AD-A137  960  DEALS  AMONG  RATIONAL  AGENTS(U)  STANFORD  UNIV  CA  DEPT  OF  171 
COMPUTER  SCIENCE  J  S  ROSENSCHEIN  ET  AL.  MAR  85 
STAN-CS-85-1842  N08029-82-C-8126 

UNCLASSIFIED  F7G  1271  NL 


JUKI  AD-A157  960 


March  1985 


CLEARED  Report  r< 

top  publication  Alson 

JUL  3  0  1985  19 

directorate  for  freeoom  of  ihfor  mation 

ANO  StCURlTV  REVIEW  (OASD-PA) 

DEPARTMENT  OF  DEFENSE 

0F  material  does  not  IMPLY 

of  DEFENSE  INDORSEMENT  of 
FACTUAL  ACCURACY  OR  OPINION. 

Deals  Among  Rational  Agents 


Report  No.  ST  AN-CS-85* 1042 
Also  numbered:  HPP-84-44 


Jeffrey  S.  Roscnschein 
Michael  R.  Gcncscrcth 


Contract  N00039-83-C-0136 


Department  of  Computer  Science 

Stanford  University 
Stanford,  CA  94305 


APP3OVE0  SASE? 

DISTRIBUTION  IS  U.  S.l.'AlTED  (A) 


Stanford  Heuristic  Programming  Project 
Report  No.  HPP-84-44 


stub 


December  1984 


Deals  Among  Rational  Agents 


Jeffrey  S.  Rosenschein 
Michael  R.  Genesereth 


COMPUTER  SCIENCE  DEPARTMENT 
Stanford  University 
Stanford,  California  94305 


ta;; 

•I 


i 


{  By - 

I  Distribution/ 

Availability  Codes 

|Av;.il  r.n  :/or 
Dist  I  rv'Oc' al 


Deals  Among  Rational  Agents 


Abstract 


— — A  formal  framework  is  presented  that  models  communication  and  promises  in  multi¬ 
agent  interactions.  This  framework  generalizes  previous  work  on  cooperation  without 
communication,  and  shows  the  ability  of  communication  to  resolve  conflicts  among  agents 
having  disparate  goals.  Using  a  deal-making  mechanism,  agents  are  able  to  coordinate  and 
cooperate  more  easily  than  in  the  communication-free  model.  In  addition,  there  are  certain 


types  of  interactions  where  communication  makes  possible  mutually  beneficial  activity  that 


is  otherwise  impossible  to  coordinate. 

h.sr-jy,--  (  [A?* 

§1.  Introduction 


ate. 
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1.1  Artificial  Intelligence  and  the  Multi- Agent  Paradigm 

Research  in  artificial  intelligence  has  focused  for  many  years  on  the  problem  of  a  single 
intelligent  agent.  This  agent,  usually  operating  in  a  relatively  static  domain,  was  designed 
to  plan,  navigate,  or  solve  problems  under  certain  simplfying  assumptions,  most  notable 
of  which  was  the  absence  of  other  intelligent  entities. 

The  presence  of  multiple  agents,  however,  is  an  unavoidable  condition  of  the  real 
world.  People  must  plan  actions  taking  into  account  the  potential  actions  of  others,  which 
might  be  a  help  or  a  hindrance  to  their  own  activities.  In  order  to  reason  about  others’ 
actions,  a  person  must  be  able  to  model  their  beliefs  and  desires. 

The  artificial  intelligence  community  has  only  lately  come  to  address  the  problems 
inherent  in  multi-agent  activity.  A  community  of  researchers,  working  on  distributed 
artificial  intelligence  (DAI),  has  arisen.  Even  as  they  have  begun  their  work,  however, 
these  researchers  have  added  on  a  new  set  of  simplifying  assumptions  that  severely  restrict 
the  applicability  of  their  results. 

1.2  Benevolent  Agents 

Virtually  all  researchers  in  DAI  have  assumed  that  the  intelligent  agents  in  their 
domain  have  identical  or  non-conflicting  goals.  Work  has  thus  proceeded  on  the  question 
of  how  these  agents  can  best  help  one  another  in  carrying  out  their  common  tasks  [3,  4, 
6,  7,  24],  or  how  they  can  avoid  interference  while  using  common  resources  [10,  11].  The 
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rationale  for  studying  multiple  agent  interaction  stems  from  a  desire  for  increased  system 
efficiency  or  increased  capabilities.  For  example,  it  is  hoped  that  if  a  group  of  agents  carry 
out  a  task  cooperatively,  the  task  will  take  less  time  than  if  it  were  performed  by  a  single 
agent. 

Of  course,  when  there  is  no  conflict,  there  is  no  need  to  study  the  wide  range  of 
interactions  that  can  occur  among  intelligent  agents.  All  agents  are  fundamentally  assumed 
to  be  helping  one  another,  and  will  trade  data  and  hypotheses  as  well  as  carry  out  tasks 
that  are  requested  of  them.  We  call  this  aspect  of  the  paradigm  the  benevolent  agent 
assumption. 

1.3  Interactions  of  a  More  General  Nature 

In  the  real  world,  agents  are  not  necessarily  benevolent  in  their  dealings  with  one 
another.  Each  agent  has  its  own  set  of  desires  and  goals,  and  will  not  necessarily  help 
another  agent  with  information  or  with  actions.  Of  course,  while  conflict  among  agents 
exists,  it  is  not  total.  There  is  often  potential  for  compromise  and  mutually  beneficial 
activity.  Previous  work  in  distributed  artificial  intelligence,  bound  as  it  has  been  to  the 
benevolent  agent  assumption,  has  generally  been  incapable  of  handling  these  types  of 
interactions. 

Intelligent  agents  capable  of  interacting  even  when  their  goals  arc  not  identical  would 
have  many  uses.  For  example,  autonomous  land  vehicles  (ALV’s),  operating  in  a  combat 
environment,  can  be  expected  to  encounter  both  friend  and  foe.  In  the  latter  case  there 
need  not  be  total  conflict,  and  in  the  former  there  need  not  be  an  identity  of  interests.  One 
can  imagine  ALV’s  from  two  different  battalions,  or  from  two  NATO  allies,  meeting  and 
having  different  goals  (though  of  course  a  certain  amount  of  cooperation  would  be  called 
for).  Even  encounters  with  the  enemy  may  have  the  potential  for  mutually  beneficial 
compromise  (for  example,  each  may  find  it  advantageous  to  tcmporarilty  avoid  combat). 

Other  domains  in  which  general  interactions  are  prevalent  are  resource  allocation  and 
management  tasks.  An  automated  secretary  [12],  for  example,  may  be  required  to  coordi* 


nate  a  schedule  with  another  automated  (or  human)  secretary,  while  properly  representing 
the  desires  of  its  owner.  The  full  capability  to  negotiate,  to  compromise  and  promise, 
would  be  highly  desirable  in  these  types  of  encounters. 

Finally,  even  in  situations  where  all  agents  in  theory  have  a  single  goal,  the  complexity 
of  interaction  might  be  better  handled  by  a  framework  that  recognizes  and  resolves  sub* 
goal  conflict  in  a  general  manner.  For  example,  robots  involved  in  the  construction  of  a 
space  station  are  fundamentally  motivated  by  the  same  goal;  in  the  course  of  construction, 
however,  there  may  be  many  minor  conflicts  caused  by  occurrences  that  cannot  fully  be 
predicted  (e.g.,  fuel  running  low,  drifting  of  objects  in  space).  The  building  agents,  each 
with  a  different  task,  could  then  negotiate  with  one  another  and  resolve  conflict. 

1.4  Game  Theory’s  Model  and  Extensions 

In  modeling  the  interaction  of  agents  with  potentially  diverse  goals,  we  borrow  the 
simple  construct  of  game  theory,  the  payoff  matrix.  Consider,  for  example,  the  following 
matrix: 


A 

B 

A 

3/1 

2 

B 

2/5 

o/i 

The  first  player  is  assumed  to  choose  one  of  the  two  rows,  while  the  second  simulta¬ 
neously  picks  one  of  the  two  columns.  The  row/column  outcome  determines  the  payoff  to 
each;  for  example,  if  the  first  player  picks  row  B  and  the  second  player  picks  column  A, 
the  first  player  receives  a  payoff  of  2  while  the  second  receives  a  payoff  of  5.  If  the  choice 
results  in  an  identical  payoff  for  both  players,  a  single  number  appears  in  the  square  (e.g., 
the  A/D  payoff  above  is  2  for  both  players).  Payoffs  designate  utility  to  the  players  of  a 
particular  joint  move  [18]. 

Game  theory  addresses  the  issues  of  what  moves  a  rational  agent  will  make,  given 
that  other  agents  arc  also  rational.  We  wish  to  remove  the  a  priori  assumption  that 
other  agents  will  necessarily  be  rational,  while  at  the  same  time  formalizing  the  concept  of 


rationality  in  various  ways  (and  with  greater  precision  than  is  generally  done  in  the  game 
theory  literature). 

Our  model  in  this  paper  allows  communication  among  the  agents  in  the  interaction, 
and  allows  them  to  make  binding  promises  to  one  another.  We  will  consider  a  variety  of 
assumptions  about  the  rationality  of  the  agents,  as  to  how  they  decide  on  both  moves  and 
promises  to  make.  The  formalism  handles  the  case  of  agents  with  disparate  goals  as  well 
as  the  case  of  agents  with  identical  goals. 

§2.  Notation 

We  expand  on  the  notation  developed  in  [8].  For  each  game  there  is  a  set  P  of  players 
and,  for  each  player  i  6  P,  a  set  M,  of  possible  moves  for  t.  For  S  C  P,  we  denote  P  —  S 
by  S,  and  write  t  instead  of  {*}  (so  t  =  P  —  {t}).  We  write  Ms  for  flies 

We  denote  by  ms  an  element  of  M<j;  this  is  a  joint  move  for  the  players  in  S.  To 
ms  €  Ms  and  m§  €  M§  correspond  an  element  m  of  Mp.  The  payoff  function  for  a  game 
is  a  function 

p  :  P  x  Mp  — ►  R 

whose  value  at  (*,  rri)  is  the  payoff  for  player  *  if  move  m  is  made. 

Each  agent  is  able  to  specify  a  set  of  joint  moves  (i.e.,  elements  of  Mp)  that  specify 
outcomes  the  agent  is  willing  to  accept;  this  set  is  called  an  offer  group.  If  any  move 
or  moves  offered  by  one  agent  are  likewise  offered  by  all  other  agents,  this  set  of  moves 
constitutes  the  deal  (i.e.,  the  deal  is  the  intersection  of  all  the  agents’  offer  groups).  In 
practice,  a  single  clement  of  the  deal  set  will  be  selected  by  a  fair  arbiter,  and  the  result 
of  the  selection  communicated  to  all  agents.  At  that  point,  the  agents  are  all  compelled 
to  carry  out  their  part  of  the  move.  Of  course,  if  the  deal  set  has  only  one  member,  no 
arbiter  is  needed. 

We  now  define  a  secondary  payoff  function  pay(i,mt,  Dt),  the  set  of  possible  payoffs 


to  t  of  making  move  m*  and  suggesting  offer  group  P*: 

{{p(t,  d)  :d  €  Pi  A  3  Or  (Of  €  allowed0[i,  Pi)  A  d  €  Or)},  if  such  a  d 

exists; 

{p(*,m)  :  mi  €  allowedm{i, m,)},  otherwise. 

all  owed  mi},  mt)  is  the  set  of  moves  that  other  agents  might  potentially  make  while  t  makes 
move  mt,  and  allowcd0(i,Di )  is  the  set  of  deals  that  other  agents  might  make  while  i 
suggests  offer  group  D,.  Our  formalism  implicitly  separates  offer  groups  from  moves  (i.e., 
there  will  be  no  effect  on  moves  by  offer  groups  or  vice  versa).  Intuitively,  this  reflects 
simultaneously  revealing  one’s  move  and  offer  group,  with  one’s  eventual  action  determined 
by  others’  offer  groups  (that  is,  only  if  there  is  no  agreement  will  you  have  to  carry  out 
your  move).  Future  work  might  investigate  the  situation  where  offers  are  made  before 
moves  are  chosen,  and  may  thus  affect  them. 

For  nonempty  sets  {a,}  and  {/?_,},  we  write  {a,}  <  {/?,}  if  a,  <  (3,  for  all  i,j  (and 
say  that  {/?_, }  strictly  dominates  {<*,})•  Likewise,  we  write  {a,}  <  {/?,}  for  nonempty  sets 
{a,}  and  {/?,}  if  a,  <  for  all  i,j  and  the  inequality  is  strict  in  at  least  one  case.  We 
then  say  that  {/?7}  dominates  {ay}. 

Finally,  we  define  p(S, d«;)  as  {p(t, d)  :  i  &  S  A  d§  E  Mg}.  These  are  the  possible 
payoffs  to  a  group  S  of  players  of  making  move  dg- 

2.1  Rational  Moves 

We  will  denote  by  R,„(p,t)  the  set  of  rational  moves  for  agent  t  in  game  p.  We  use 
the  following  definition  to  constrain  what  moves  are  elements  of  Rm{p, i)  (i.e.,  what  moves 
are  rational): 

pay{i,d„0)  <  poy(i',c,,0)  d,  £  R„,(p,t).  (1) 

In  other  words,  if,  when  no  binding  agreement  will  be  reached,  every  possible  payoff  to 
i  of  making  move  d,  is  less  than  every  possible  payoff  to  t  of  making  move  c,,  then  d,-  is 
irrational  for  t.  Of  course,  this  docs  not  imply  that  c,  is  rational,  since  better  moves  may 
still  be  available. 


In  general,  it  will  not  be  possible  to  fully  specify  the  value  of  pay(t,m,,0)  for  all  m„ 
since  there  is  not  full  information  as  to  the  moves  that  the  other  agents  will  make.  Instead, 
we  use  (1)  to  show  that  some  moves  are  not  rational. 

2.2  Rational  Offer  Groups 

We  define  a  rational  offer  group  in  a  way  analogous  to  how  we  defined  a  rational  move 
above.  We  denote  by  R0(p,i)  the  set  of  rational  offer  groups  for  agent  t  in  game  p,  and 
characterize  a  rational  offer  group  by  the  following  constraint  on  /20(p,t)’s  members: 

3m,- [pay  (t,  m„  P.)  <  pay(i,  m„  0,)]  =>  P,  £  P0(p,»).  (2) 

In  other  words,  if  for  some  move  m,-  every  possible  payoff  resulting  from  offer  group  P<  is 
less  than  every  possible  payoff  resulting  from  offer  group  Oj,  then  Pi  is  not  a  rational  offer 
group. 

There  is  one  additional  constraint  on  members  of  R0(p,t):  rational  offer  groups  specify 
(through  .the  function  p)  a  continuous  range  of  payoffs  that  are  acceptable  to  an  agent. 
Intuitively,  a  rational  offer  group  must  reflect  the  notion  of  “monotonic  satisfaction” — if  a 
rational  agent  is  satisfied  with  a  particular  payoff,  he  will  be  satisfied  with  one  of  equal  or 
greater  value  (this  is  a  fundamental  meaning  of  “utility”).  Formally,  we  write 

[p(*i  r)  <  p(t,  s)  A  f  G  O.J  =>  s  e  0,  (3) 

i 

for  all  O,  R„{p,  i)  and  moves  f  and  s.  For  a  particular  game  and  player,  a  rational  offer 
group  can  thus  be  unambiguously  specified  by  any  of  its  members  with  the  lowest  payoff. 

In  general,  there  may  be  more  than  one  rational  offer  group  for  an  agent  in  a  game. 
If  full  information  were  available  to  an  agent  about  the  offers  others  were  going  to  make 
(along  with  their  “backup  moves”),  it  would  be  trivial  to  determine  R„(p,i).  In  practice 
such  information  is  not  available,  but  a  rational  agent  i  may  be  able  to  discover  some 
rational  offer  group,  i.e.,  some  offer  group  provably  in  R„(pti). 

2.3  Rational  Moves  and  Offer  Groups  for  a  Set  of  Players 

We  also  wish  to  define  the  rational  moves  and  the  rational  offer  groups  available  to  a 
set  of  players.  For  S  C  P,  we  denote  by  il,n(p,5)  the  rational  moves  for  the  group  S  in 


the  game  p.  It  follows  that  the  members  of  Rm(p,  S)  are  elements  of  Ms-  We  assume  that 

Rm{p,S)  C  Rm(ptSt)  x  Ms-s’  for  5'  C  S. 

This  states  that  no  rational  move  for  a  set  can  require  irrationality  on  the  part  of  a  subset. 
An  obvious  consequence  of  this  assumption  is  that 

■Rm(P)5)  C  J"J  Am(p, t). 

*es 

A  move  that  is  rational  for  a  group  of  players  is  thus  rational  for  each  player  in  the  group. 

Similarly,  we  denote  by  R0{p,  S)  the  set  of  rational  deals  for  S  in  the  game  p  (that 
is,  the  members  of  R0(p,S)  are  sets  of  elements  from  A/p).  It  is  the  “crossproduct- 
intersection”  of  rational  offer  groups  for  the  individual  agents: 

rt0(p,S)  =  {O  :  O  =  p|  Oj  A  Oj  €  R„(pJ)}- 
jes 

2.4  Rationality  Assumptions 

The  value  of  pay(t,m<,  £),)  will  depend,  of  course,  on  the  values  of  allowedm[i,mi) 
and  allow  ed0{i,Di)  (i.e.,  the  moves  and  the  deals  that  other  agents  can  make).  In  order 
to  constrain  the  value  of  pay,  we  now  define  each  of  the  allowed  functions  ( allowedm  is 
defined  as  in  [8]). 

1.  Minimal  move  rationality:  af/tw;e(im(t,mj)  =  A/;.  Each  player  assumes  that  the 
others  may  be  moving  randomly. 

2.  Separate  move  rationality:  allowed„t{i,mi)  G  Rrn(p,i).  Each  player  assumes  that 
the  others  are  moving  rationally. 

3.  Unique  move  rationality:  For  all  m,  and  m',  allow edrn(i,mi)  =  af/otwedm(*,m') 
and  \allowedTn(i, m,)|  =  1.  Each  player  assumes  that  the  others’  moves  are  fixed  in 
advance.  This  may  be  combined  with  separate  rationality. 

The  assumptions  above  do  not  fully  specify  what  is  or  is  not  a  rational  move.  Rather, 
they  help  constrain  the  set  of  rational  moves  by  allowing  us  to  prove  that  certain  moves 
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are  not  rational.  We  now  define  analogous  assumptions  regarding  deals  other  agents  might 
be  making: 

1.  Minimal  deal  rationality:  allowcd0(i,Di)  C  P(Mp),  where  P(Mp)  denotes  the 
power  set  of  Mp.  Each  player  assumes  that  the  others  may  be  making  random  deals. 

2.  Separate  deal  rationality:  allowed0(i,  A)  C  Ro(p »*)•  Each  player  assumes  that 
the  others  are  making  rational  deals. 

3.  Unique  deal  rationality:  allowed0(t,  Di)  =  allowed0(i,  Ei)  and  \allowed0(t,  2?,)|  = 
1  for  all  D,  and  Et.  Each  player  assumes  that  the  others'  offers  are  fixed  in  advance. 
This  may  be  combined  with  separate  deal  rationality. 

We  will  refer  to  the  combination  of  separate  and  unique  move  rationality  as  indi¬ 
vidual  move  rationality,  and  to  the  combination  of  separate  and  unique  deal  rationality 
as  individual  deal  rationality.  As  in  [8],  any  move  that  can  be  proven  irrational  under 
the  assumption  of  minimal  move  rationality  will  be  similarly  irrational  under  the  other 
move  rationality  assumptions.  Analogously,  any  offer  group  that  can  be  proven  irrational 
under  the  assumption  of  minimal  deal  rationality  will  be  irrational  under  the  other  deal 
rationality  assumptions. 

§3.  Rational  Deal  Characteristics 

With  our  notational  conventions  defined,  we  now  prove  several  characteristics  of 
R„(p,i).  We  will  henceforth  use  a  to  denote  any  move  that  gives  agent  *  his  highest 
payoff. 

Theorem  1  (Existence  of  a  non-null  rational  offer  group).  |/?0(p,t)|  >  1. 

Proof.  If  /?„(;>,  t)  were  empty  then  i  would  do  best  by  making  no  offers  and  relying  on  his 
move  to  generate  his  payoff.  But  pay(i,mi,s)  will  be  greater  than  or  equal  to  pat/(*,m,,0) 
for  all  mi  (since  a  will  cither  be  matched  by  other  agents,  increasing  t’s  payoff,  or  will  not 
be  matched,  and  will  therefore  be  harmless  since  it  doesn’t  affect  other’s  moves).  Thus 
the  offer  group  {a  }  would  also  be  in  R„(p,i),  guaranteeing  it  to  have  at  least  one  non-null 
member.  □ 


It  follows  directly  from  the  definition  of  a  rational  offer  group  (3)  that  all  non-empty 
members  of  t’s  set  of  rational  offer  groups  include  a.  Together  with  Theorem  1,  this  implies 
that  it  is  always  rational  for  an  agent  to  include  in  his  offer  group  the  move  that  gives  him 
his  highest  payoff. 

In  addition,  an  agent  can  often  restrict  his  offers  to  those  whose  payoffs  are  higher 
than  that  which  he  can  get  by  making  the  null  offer,  relying  on  his  move  to  give  him  this 
payoff. 

Theorem  2  (Lower  bound).  Assuming  unique  deal  rationality,  if  for  any  move  m,  and 

— ♦ 

joint  move  d  ^  s. 

p(i,d)  <  pay(t,m,,0), 

30i[0i  e  R0(p,i)  A  d  <£Oi\. 

Proof.  There  are  two  cases: 

1.  p(i,d)  <  pay(t, m,, 0):  The  only  way  for  d  to  be  in  some  rational  offer  group  P,-  is 
for  the  d  deal  not  to  be  accepted  (otherwise  pay(t,  mt,  Pt)  would  be  dominated  by 
the  offer  group  pay[i,mi,Oi)  where  O,-  =  {c  :  p(t,c)  >  p(t,  <?)}).  But  if  d  is  not 
accepted,  then  it  is  equivalent  to  another  offer  group  that  includes  only  those  moves 
with  payoffs  higher  than  d.  This  smaller  offer  group  will  then  also  be  in  R0(p,i). 

-4  *4 

2.  p(t,d)  =  pay(t,m,,0):  Assume  that  d  is  in  some  rational  offer  group  Pt.  If  d  is  not 

accepted,  or  is  accepted  along  with  other  offers,  then  pay(i,m,,Oi)  >  pay(i,rrii,  Pi) 
where  O*  =  {<?  :  p(i,  c)  >  p(t,d)},  so  there  is  another  rational  offer  group  (namely  Oi) 
without  d.  If  d  is  the  only  accepted  offer,  then  pay(i,mi,  s)  =  pay(i,rrii,  Px)  (where 
S  is  the  move  that  gives  t  his  highest  payoff),  since  s  will  not  be  accepted  anyway 
and  therefore  pay(t, m,,  s)  —  pny{i,  m,,0).  Again,  there  is  a  rational  offer  group  that 
does  not  include  d .  □ 

Note  that  Theorem  2  will  not  hold  for  s  (i.e.,  the  joint  move  that  gives  *  his  highest 
payoff)  since  that  would  contradict  Theorem  1  (Theorem  2’s  proof  makes  implicit  use  of 
the  fact  that  d  ^  s  in  its  construction  of  the  dominating  offer  group  Ot).  Note  also  that 
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Theorem  2  will  not  hold  under  minimal  deal  rationality.  Imagine  that  a  perverse  opponent 
chooses  his  offer  group  as  follows: 

1.  If  you  include  in  your  offer  group  deals  with  low  payoff  (for  you),  he  will  accept  the 
deal  with  your  best  payoff; 

2.  If  you  don’t  offer  that  low  deal  he  will  accept  no  deals  and  you  will  have  to  rely  on 
your  move  to  get  a  payoff. 

Under  these  circumstances  (fully  consistent  with  minimal  deal  rationality),  it  might  be  to 
your  advantage  to  offer  a  low-payoff  deal,  since  that  might  be  the  only  way  to  get  your 
maximal  payoff. 

3.1  Restricted  Case  Analysis 

The  consequences  of  Theorem  2  will  differ,  of  course,  based  on  assumptions  about 
allowedm  since  these  will  affect  pay(i, m^,  0)  for  any  given  rrij.  Consider  the  following 
payoff  matrix: 


A 

B 

A 

1/4 

0/5 

B 

3/2 

2/7 

It  is  shown  in  [8]  that,  assuming  minimal  move  rationality  (potentially  random  or  even 
malevolent  moves  by  other  agents),  the  row  agent  can  still  use  “restricted  case  analysis” 
to  constrain  his  move  to  D.  If  unique  deal  rationality  can  be  assumed  then  the  offer  group 
consisting  solely  of  move  B/A  (i.e.,  bottom  left  corner)  is  guaranteed  by  Theorems  1  and 
2  to  be  a  rational  offer  group.  Of  course,  there  may  be  other  rational  offer  groups,  for 
example  the  offer  {B/B,  B/A),  depending  on  what  deals  the  other  player  can  offer. 

We  formalize  part  of  the  above  discussion: 

Corollary  3  (Restricted  case  analysis).  Assuming  minimal  move  rationality  and 
unique  deal  rationality,  if  for  some  c,  and  d,,  for  all  c,  and  dr, 

p(t>  d)  <  p(t,c), 
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then  there  exists  an  O,-  G  R0(p, *’)  such  that  no  d  is  in  O,-. 
Proof.  Follows  from  Lemma  3  in  [8]  and  Theorem  2.  □ 


3.2  Case  Analysis  and  Iterated  Case  Analysis 

Restrictions  on  rational  offer  groups  analogous  to  Corollary  3  apply  for  case  analysis 
and  iterated  case  analysis  under  the  assumptions  of  unique  and  individual  move  rationality, 
respectively.  The  case  analysis  situation  is  represented  in  the  following  payoff  matrix,  seen 
from  the  row  player’s  perspective: 


A 

B 

A 

4/1 

2 

B 

3/5 

0/1 

The  row  player  need  only  assume  that  the  other  player’s  move  will  not  be  affected 
by  his  own  move  (i.e.,  unique  move  rationality)  to  realize  that  making  move  A  is  in  all 
circumstances  superior  to  making  move  B.  As  long  as  unique  deal  rationality  can  also  be 
assumed,  there  is  a  guaranteed  rational  offer  group  consisting  only  of  move  A/ A. 

Corollary  4  (Case  analysis).  Assuming  unique  move  rationality  and  unique  deal  ratio¬ 
nality,  if  for  some  Cj  and  d,,  for  all  c;  and  d,  with  c*  =  dit 

p(i,d)  <p(t,c). 

then  there  exists  an  O,  G  such  that  no  d  is  in  O,-. 

Proof.  Follows  from  Lemma  4  in  [8]  and  Theorem  2.  □ 

Similarly,  if  the  column  player  can  assume  that  the  row  player  is  rational  and  making 
moves  independent  of  the  column  player’s  moves  (i.e.,  individual  move  rationality),  then 
he  can  prove  that  move  B  is  optimal  in  the  above  matrix  (since  the  row  player  will  play 
A).  With  unique  deal  rationality,  he  has  a  guaranteed  ration<al  offer  group  of  A/B. 

The  effect  of  Theorems  1  and  2  is  to  show  us  that  there  is  always  a  rational  offer 
group  that  includes  an  agent’s  highest  payoff  outcome,  and  includes  no  outcomes  below  or 

II 


equal  to  what  he  could  achieve  without  deals.  Below,  we  consider  other  constraints  on  an 
agent’s  rational  offer  groups. 

§4.  The  Group  Rationality  Theorem 

The  work  in  [8]  and  [9j  was  concerned  with  the  formalization  of  cooperative  behav¬ 
ior,  given  certain  constraints  about  the  agents  participating  in  an  interaction.  Using  our 
notation,  a  desirable  general  result  would  have  been 

pay(P,  d,  0)  <  pay(P ,  c  ,  0)  =>  d  $  £„,(?,  P),  (4) 

that  is,  if  any  joint  move  for  all  players  is  dominated  by  any  other,  then  the  dominated 
joint  move  is  not  rational  for  them.  This  result  could  not  be  proven,  and  the  inability  to 
do  so  stemmed  directly  from  the  lack  of  communication  inherent  in  the  model.  Without 
at  least  minimal  communication  (e.g.,  self-identification),  there  is  no  way  to  coordinate  on 
a  universally  perceived  best  move  when  several  such  moves  exist. 

We  are  now  able  to  derive  an  important  result  about  R0(p,P)  very  similar  to  the 
elusive  non-coinmunication  result  in  (4). 

Theorem  5  (Group  offers).  Assuming  individual  deal  rationality, 
p(P,d)  <  p(P,  c)  =>  BOj  [Oi  E  R„(p,i)  A  d  <£  Oi\ 

for  alii  E  P. 

Proof.  There  are  two  possible  cases: 

1.  VOj  [Oj  E  R„[p,i)  =>  d  £  Oj ]:  Since  d  will  not  be  a  consummated  deal,  if  Pi  is  any 
offer  group  containing  d  then  pay(t,mi,Ol)  >  pay(i,mt,  P,)  where  O,  =  {c  :  p(i,c)  > 
p(t,d)}.  Along  with  Theorem  1,  this  shows  the  existence  of  a  non-null  rational  offer 
group  without  d. 

2.  30/(0;  E  R0(p,t)  A  d  E  Oj}\  All  other  agents  are  rational  (by  assumption),  and  any 

— * 

rational  offer  group  that  includes  d  also  includes  c  (3);  thus,  if  P,  is  any  offer  group 
containing  d,  then  pay(i,rrii,Oi)  >  pay(i,m,,  Px)  where  Oi  =  {c  :  p(*,c)  >  p(t,d)}. 


This,  along  with  Theorem  1,  shows  the  existence  of  &  rational  offer  group  without  d. 
□ 

Because  of  Theorem  5,  a  rational  agent  interacting  with  other  rational  agents  knows 
that  he  need  not  offer  a  move  that  is  dominated  for  all  players — doing  so  cannot  increase 
his  payoff.  If  the  other  rational  agents  also  know  that  all  agents  are  rational,  they  too 
will  realize  that  they  can  refrain  from  offering  a  move  that  is  dominated  for  all  players. 
Higher  levels  of  knowledge  [13],  such  as  their  knowing  that  all  agents  know  that  all  agents 
are  rational,  are  not  needed.  In  addition,  because  of  the  definition  of  rational  offer  groups 
(3),  the  agents  can  refrain  from  offering  any  moves  with  smaller  payoffs,  since  those  groups 
would  necessarily  include  the  dominated  move. 

§5.  Examples 

We  will  now  examine  the  consequences  of  our  rational  offer  theorems  in  several  addi¬ 
tional  types  of  games. 

5.1  Best  Plan 

The  best  plan  scenario  is  reflected  in  the  following  matrix: 


A 

B 

A 

7 

4 

B 

5 

6 

All  agents  recognize  that  there  is  a  single  best  move;  how  will  their  offer  groups  reflect 
this?  From  Theorem  1,  a  rational  agent  knows  that  he  can  safely  offer  the  move  that  gives 
him  his  best  payoff  (i.e.,  move  A/ A),  even  assuming  minimal  deal  rationality  on  the  part 
of  other  players  (though  the  theorem  is  noncommittal  as  to  whether  other  moves  can  or 
should  be  included  with  it).  All  players  can  also  rule  out  move  A/B  using  Theorem  2  if 
unique  deal  rationality  holds  (since  A/B  yields  the  lowest  payoff).  If  there  is  an  assumption 
of  individual  deal  rationality,  Theorem  5  can  guarantee  each  agent  that  the  offer  group 
consisting  solely  of  A) A  is  rational.  Communication  thus  allows  coordination  on  the  best 
plan  under  more  intuitive  assumptions  about  the  interaction  than  those  used  in  [8]. 


S.2  Breaking  Symmetries — Multiple  Best  Plan 

Our  rational  offer  group  theorems  allow  us  to  solve  the  “Multiple  Best  Plans’*  case 
that  could  not  be  solved  in  [8].  The  following  matrix  illustrates  the  scenario: 


A 

B 

A 

-1 

2 

B 

2 

-1 

Assuming  minimal  deal  rationality,  an  agent  can  rationally  offer  B/A  and  A/B.  In  addi* 
tion,  assuming  unique  deal  rationality  an  agent  knows  that  he  can  rationally  not  offer  A/ A 
and  B/B  (since  they  are  lowest  yield  moves).  This  analysis  can  be  done  by  both  agents 
if  they  are  rational  and  operating  under  the  unique  deal  assumption.  Their  offer  sets  will 
overlap  on  the  multiple  best  outcomes;  selection  of  a  single  alternative  from  the  multiple 
agreements  then  occurs. 

5.3  Prisoner’s  Dilemma 

The  prisoner’s  dilemma  is  represented  by  the  following  matrix: 

IT 

JD 

Each  agent  most  desires  to  play  D  while  the  opponent  plays  C,  then  to  play  C  along  with 
the  opponent,  then  to  play  D  along  with  the  opponent,  and  least  of  all  to  play  C  while 

i 

the  opponent  plays  D.  The  dilemma  comes  about  because  case  analysis  implies  that  it  is 
always  better  to  play  D\  both  players  choosing  D,  however,  is  less  desirable  for  both  than 
if  they  had  chosen  C.  The  dilemma  has  received  much  attention  within  the  philosophy  and 
game  theory  literature  [2,  5,  22,  26].  In  the  usual  presentation  of  the  prisoner’s  dilemma, 
playing  C  is  called  “cooperating,”  and  playing  D  is  called  “defecting.”  With  the  presence 
of  communication,  in  fact,  there  is  no  dilemma: 

Corollary  6  (Prisoner’s  Dilemma).  If  all  players  know  that  all  players  are  operating 
under  the  assumption  of  individual  deal  rationality,  agents  will  cooperate  in  the  prisoner's 
dilemma. 


0 


D 


0/5 


5/0 
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Proof.  The  first  player  knows  that  it  is  rational  to  offer  D/C  (since  it  is  rational  even 
under  minimal  rationality,  Theorem  1);  he  also  knows  it  is  irrational  to  offer  C/D  (from 
Theorem  2,  since  individual  deal  rationality  includes  unique  deal  rationality).  By  Theo¬ 
rem  5,  there  is  a  rational  offer  group  without  D/D.  Now  he  knows  that  the  other  agent 
will  not  offer  D/C  (since  the  other  agent  is  assumed  rational  and  operating  under  the 
assumption  of  unique  deal  rationality,  Theorem  2).  Since  D/C  will  certainly  not  be  met, 
pay[i,D,  {D/C})  <  pay(ttD,{D/C,C/C}).  Thus,  the  offer  group  {D/C, C/C}  is  ratio¬ 
nal.  The  second  agent  will,  if  rational  and  working  under  the  same  assumptions,  come  to 
the  same  conclusion.  The  deal  C/C  will  be  struck,  and  the  agents  avoid  the  D/D  trap. 
□ 

§6.  Extending  the  Model 

For  certain  types  of  interactions,  the  model  presented  above  (i.e.,  the  various  assump¬ 
tions  and  theorems  about  rational  moves  and  deals)  does  not  specify  rational  activity  in 
sufficient  detail.  We  can  extend  the  model  in  a  variety  of  ways  to  handle  these  cases,  and  at 
the  same  time  capture  a  wider  range  of  assumptions  about  the  interaction.  In  this  section, 
we  briefly  present  some  of  the  extensions  that  might  be  made  to  our  original  model. 

6.1  Similar  bargainers 

Consider  the  following  payoff  matrix  (equivalent  to  game  77  in  Rapoport  and  Guyer’s 
taxonomy  [23]) 


A 

B 

A 

3 

2 

B 

5/0 

0/5 

Assuming  separate  deal  rationality,  the  first  player  can  assume  that  D/A  should  be 
in  a  rational  offer  group  of  his,  and  that  D/B  should  not  be.  What  else  can  be  said  about 
what  constitutes  a  rational  offer  group  in  this  game?  There  are  three  choices,  namely 
{B/A},  {A/ A,B / A),  and  {A/B,A/A,  B/A).  In  order  to  decide  among  the  choices,  we 
would  like  to  make  more  assumptions  about  the  "bargaining  tendencies”  of  the  other  agent 
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8.2  Stochastic  Model — The  Game  of  Chicken 


Note,  however,  the  following  payoff  matrix  (commonly  known  as  the  game  of  chicken 
[23]): 


A 

B 

A 

3 

2/5 

B 

5/2 

1 

Two  agents,  even  if  they  assume  individual  deal  rationality  and  the  similar  bargainers 
assumption,  will  be  faced  with  the  following  choices:  a  payoff  of  {3}  or  a  payoff  of  {2, 3, 5}. 
According  to  our  definitions,  neither  of  these  sets  dominates  the  other,  and  it  is  not  clear 
how  to  decide  between  them. 

If,  however,  we  extend  the  model  to  include  a  probabilistic  choice  from  within  the 
agreement  set,  it  is  clear  that  the  latter  agreement  set  dominates  the  former  (with  an 
expected  value  of  3.33  versus  3).  This  would  correspond  to  the  fair  arbiter  mentioned 
earlier  tossing  an  n-sided  coin  to  decide  among  the  n  members  of  an  agreement  set.  The 
usual  understanding  of  "utility”  also  supports  viewing  the  payoffs  in  this  way. 

A  further  stochastic  extension  to  our  model  would  allow  moves  themselves  to  be 
specified  probabilistically  (e.g.,  A  with  probability  .5,  and  D  with  probability  .5).  In  the 
game  theory  literature,  this  is  the  distinction  between  pure  strategics  and  mixed  strategies 
[18].  An  analysis  of  this  model  is  beyond  the  scope  of  the  present  discussion. 

6.3  Conjunctive  Offers — Battle  of  the  Sexes 

In  the  game  of  chicken  example  presented  above,  there  was  an  added  complexity  that 
was  temporarily  ignored:  the  possibility  of  "defection.”  If  one  agent  reasons  that  the  other 
agent  will  accept  all  payoffs  above  2,  it  to  the  first  agent’s  benefit  to  only  offer  moves  of 
payoff  5  (this  is  analogous  to  the  prisoner’s  dilemma,  with  the  same  potential  that  both 
players  will  use  identical  reasoning  and  no  agreement  will  be  reached).  A  similar  problem 
can  be  seen  in  the  so-called  battle  of  the  sexes  matrix,  seen  below. 

T 

B 


B 


-1 


1/2 


2/1 


-1 
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(since,  in  fact,  some  agents  might  be  tougher  deal- makers  than  others).  We  will  ignore 
what  value  the  agents  might  place  on  making  a  particular  move  in  the  absence  of  a  deal, 
since  the  payoff  is  underdetermined . 

Let  us  define  two  offer  groups  O,  and  Oj  to  be  similar  if  and  only  if  they  both  have 
the  same  lower  boundary  for  what  deals  are  included  or  not  included,  similar  (Oi,Oj)  is 
true  if  and  only  if 

3n[p(t, d)  >  n  O  d  €.  Oi  A  p(j, c )  >  n  <=>  c  €  Oj] 

for  some  number  n. 

It  might  seem  that,  since  both  players’  payoffs  are  designated  in  numbers,  it  is  rea¬ 
sonable  to  compare  their  utilities  (e.g.,  player  A  values  a  payoff  of  4  more  than  player 
B  values  a  payoff  of  3).  In  fact,  this  should  not  be  taken  for  granted,  and  utility  theory 
does  not  ordinarily  allow  such  a  comparison  to  be  made.  Nevertheless,  if  we  use  the  sim¬ 
ilar  bargainers  definition,  we  implicitly  assume  some  meaningful  measure  for  comparing 
inter-personal  utility. 

One  assumption  to  use  in  deciding  upon  rational  offer  groups  is  now  that  the  other 
agent  will  accept  deals  that  you  would  accept;  that  is,  Oj  €  R„{p,j)  <=>  <3,  G  R0{p,i)  where 
similar  (Oi,Oj). 

Under  this  assumption,  we  can  decide  what  deal  is  rational  in  the  above  game.  Player 
1  reasons  that  if  he  offers  {B/A},  player  2  (who  is  a  similar  bargainer)  will  offer  only 
{B/B}.  There  will  be  no  match.  In  the  same  way,  if  it  would  be  rational  for  player 
1  to  offer  {A/A,  B/A)  then  player  2  will  offer  {A/ A,  B/B),  with  an  agreement  on  A/ A 
and  a  payoff  of  {3}  for  both.  If  player  1  offers  {A/ B,  Aj A,  B j A}  then  player  2  will  offer 
{A/B,A/A,  B/B}  and  there  will  be  agreement  on  A/A  and  on  A/B,  with  a  payoff  of  {2,3} 
for  both.  Since  {3}  dominates  (2,3),  agents  who  assume  common  knowledge  [13]  of  the 
similar  bargainer  assumption  should  choose  the  rational  offer  group  that  yields  agreement 


One  approach  to  solving  this  problem  is  to  allow  “composite”  offers,  for  example,  an 
offer  consisting  of  a  conjunct  of  several  moves  (the  conjunct  must  be  matched  exactly  in 
order  for  a  deal  to  occur).  Thus,  the  offer  consisting  of  A/B  A  B/A  can  consistently  be 
made  by  both  agents  without  the  potential  of  defection  (and  with  an  expected  utility  of 
1.5  for  each).  This  notion  can  be  extended  to  general  logical  offers  consisting  of  disjuncts, 
conjuncts  and  negations  of  joint  moves.  The  battle  of  the  sexes  can  thus  be  uniquely  solved 
with  the  assumption  of  similarity  in  bargaining,  if  conjunctive  offers  axe  allowed. 

6.4  Repeated  Interactions 

The  technique  of  allowing  conjunctive  offers,  presented  above,  in  fact  transforms  the 
original  game  into  a  new  supergame  that  can  then  be  approached  using  our  previous 
analysis.  A  similar  technique  can  be  used  when  agents  will  be  participating  in  a  fixed 
number  of  interactions,  each  with  known  payoff  characteristics. 

Imagine  that  there  are  n  games,  Pi,P2>  •  ■  •  ,Pn-  To  each  game  j  corresponds,  for  player 
t,  a  set  of  moves  M/ .  We  now  construct  a  new  game  whose  moves  for  each  player  are  the 
crossproduct  of  his  moves  in  the  individual  games: 

l 

and  whose  outcomes  are  likewise  crossproducts  of  the  individual  games’  outcomes.  Of 
course,  the  player’s  utility  from  any  of  these  supergame  outcomes  is  some  function  of  his 
utilities  from  its  constituent  outcomes;  summing  the  utilities  seems  reasonable,  although 
utility  theory  does  not  actually  sanction  such  an  approach. 

6.5  Specifying  Partial  Outcomes 

We  would  like  to  generalize  the  analysis  of  deal-making  and  promises  to  the  case  where 
agents  need  not  specify  complete  outcomes  (i.e. ,  a  joint  move  for  all  players)  in  their  offer 
group.  In  this  section  we  will  briefly  note  some  consequences  of  generalizing  the  types  of 
offers  that  can  be  made. 

Let  us  say  that  each  agent  is  able  to  specify  an  offer  group  of  partial  moves  that 
specify  outcomes  the  agent  is  willing  to  accept.  If  a  move  which  is  offered  by  one  agent 


is  likewise  accepted  by  all  other  agents  in  the  relevant  set,  those  agents  are  all  required 
to  carry  out  the  move.  Multiple  agreements  again  result  in  a  set  of  moves;  in  practice  (as 
above),  a  single  element  of  this  set  would  be  selected  by  a  fair  arbiter.  A  partial  move  r*s 
for  agent  i  is  therefore  a  vector  that  specifies  a  move  for  some  set  S  of  agents,  *  must  be 
an  element  of  5,  but  may  be  its  only  element.  Formally,  if  r 5  is  a  partial  move, 

r's  c  Mi  x  Ms~i  =  Ms- 

The  payoff  vector  p  is  defined  over  partial  moves  as 

P(*>s)  =  {p(*\0  :  r  =  r*s  x  M§}. 

We  propose  a  new  restriction  on  offer  groups  in  the  current  model  that  did  not  exist 
previously:  there  can  be  no  “contradictions”  among  an  agent’s  offers.  That  is,  we  must 
assume  tb»t  there  are  no  two  partial  moves  that  are  specified  in  the  offer  group  such  that, 
if  both  were  accepted,  the  agent  could  only  satisfy  one.  This  problem  did  not  arise  above 
because  all  agents  were  bound  by  every  offer.  Thus,  it  was  impossible  that  two  different 
agents  could  be  promised  contradictory  moves  by  an  agent  without  themselves  taking  part 
in  accepting  all  of  those  moves.  Once  the  assumption  is  made  that  an  agent  will  not 
promise  contradictory  partial  moves,  the  analysis  is  similar  to  that  developed  above. 

A  further  complication,  however,  arises  in  specifying  the  secondary  payoff  function 
pay (*,m, ,£),)  within  this  new  model;  in  particular,  what  is  t’s  payoff  when  a  deal  has 
been  reached  by  a  set  of  agents  that  does  not  include  him?  There  are  several  possible 
alternatives,  including  t’s  being  forced  to  make  move  m, ,  or  being  allowed  to  alter  his 
move  based  on  the  agreement  reached  by  the  others.  Analysis  of  the  consequences  these 
assumptions  have  on  strategics  will  be  a  subject  of  future  research,  and  is  outside  the  scope 
of  this  paper. 

There  is,  however,  another  method  for  handling  the  offer  of  partial  moves  that  dote 
allow  our  analysis  to  be  used  directly.  We  could  define  the  offer  of  a  partial  move  as 
implicitly  including  its  extension  in  the  total  move  space  (i.e.,  if  rxs  is  offered  explicitly, 
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the  implicit  offer  includes  all  joint  moves  in  Mp  that  include  r's).  Then,  of  course,  our 
original  analysis  is  completely  appropriate.  Further  work  might  also  consider  the  situation 
(with  total  or  partial  move  offers)  where  deals  consist  of  unions  of  offers  rather  than  their 
intersection. 

§7.  Previous  Work 

The  subject  of  interacting  rational  agents  has  been  addressed  within  the  field  of  ar¬ 
tificial  intelligence  as  well  as  in  the  discipline  of  game  theory.  Here  we  will  briefly  review 
relevant  contributions  from  these  two  areas,  and  contrast  our  present  approach  with  pre¬ 
vious  efforts. 

7.1  Work  in  Artificial  Intelligence 

As  mentioned  above,  researchers  in  distributed  artificial  intelligence  have  begun  to 
address  the  issues  arising  in  multi-agent  interactions.  Lesser  and  Corkill  [4]  have  performed 
empirical  studies  to  determine  cooperation  strategies  with  positive  characteristics  (such  as, 
for  example,  what  types  of  data  should  be  shared  among  distributed  processors).  They  are 
solely  concerned  with  groups  of  agents  who  share  a  common  goal,  but  have  acknowledged 
the  benefit  even  under  this  assumption  of  having  agents  demonstrate  “skepticism”  (i.e., 
not  being  distracted  by  others’  information). 

Georgcff  [10,  11]  has  developed  a  formal  model  to  combine  separate  plans  of  indepen¬ 
dent  agents.  The  primary  concern  is  to  avoid  destructive  interference  caused  by  simulta¬ 
neous  access  to  a  shared  resource.  The  model  used  assumes  that  the  agents  have  separate 
goals,  but  that  these  goals  do  not  directly  loppose  one  another.  Cooperative  action  is  nei¬ 
ther  required  nor  exploited,  except  insofar  as  it  allows  agents  to  keep  out  of  each  other's 
way. 

Other  notable  efforts  include  Smith’s  work  on  the  contract  net  [7],  Malone’s  work 
extending  the  contract  net  model  using  economic  theory  [19],  and  the  theoretical  work  on 
knowledge  and  belief  of  Appelt,  Moore,  Konolige,  Ilalpern  and  Moses  [1,  14,  15,  16,  17, 
20,  21]. 
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The  current  work  extends  these  previous  models  of  interaction  by  allowing  a  fuller 
range  of  goal  disagreements  among  agents.  By  using  a  framework  that  captures  total  and 
partial  goal  conflicts,  it  allows  investigation  into  compromise,  promises  and  cooperative 
action.  None  of  these  could  be  handled  using  previous  schemes. 

This  paper  considers  the  communication  scenario  in  ways  similar  to  the  manner  in 
which  previous  work  [8,  9]  investigated  cooperation  among  rational  agents  when  no  com¬ 
munication  occurs.  Below  we  briefly  note  the  advantages  that  were  gained  when  commu¬ 
nication  and  promises  were  added  to  the  interaction  model. 

The  best  plan  interaction  was  handled  in  our  framework  by  assuming  individual  deal 
rationality.  Because  in  the  no-communication  case  this  scenario  could  not  be  solved  using 
individual  move  rationality,  other  assumptions  were  introduced:  informed  rationality  in  [8] 
and  common  rationality  in  [9].  Informed  rationality,  in  our  notation,  constrained  aHowcdm 
in  a  way  that  assumed  each  player  would  respond  rationally  to  the  others’  moves,  whatever 
they  might  be.  For  a  fixed  ms,  the  restricted  game  was  defined  to  be 

P|m,  :  P  x  Ms  -  R  p|m#  (t,  ms)  =  p(»,m). 

Intuitively,  this  was  the  game  where  the  players  in  5  were  assumed  to  make  the  move  Ms- 
Informed  rationality  was  defined  as  follows: 

•  Informed  rationality:  allowedrn(i, m,)  =  7?(p|m.,t).  Each  player  assumes  that  all 

others  will  respond  rationally  to  whatever  move  he  makes. 

It  should  be  noted  in  passing  that  an  assumption  of  common  knowledge  of  rationality 
will  also  allow  for  a  unique  solution  to  the  best  plan  case,  though  this  has  not  been 
previously  pursued  in  the  literature. 

To  solve  the  prisoner’s  dilemma,  even  more  assumptions  had  to  be  introduced.  The 
interested  reader  is  referred  to  [8]  and  [9]  for  full  details. 

Even  using  a  variety  of  assumptions,  previous  work  could  not  handle  the  multiple  best 
plan  case,  where  there  are  several  outcomes  all  equally  recognised  as  best  by  all  players. 
To  break  the  symmetry,  some  communication  is  needed,  though  this  communication  can 
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be  as  simple  as  self-identification  and  reliance  on  a  common  rule  (e.g.,  agent  with  lowest 
name  performs  lowest  ordered  action).  We  were  able  to  solve  the  multiple  best  plan  case  as 
easily  as  the  best  plan  case,  as  well  as  handle  several  novel  interactions  through  extensions 
to  the  model,  such  as  the  similar  bargainers  assumption. 

7.2  Game  Theory 

Game  theory  has  focused  on  a  variety  of  interactions,  and  sought  to  characterize  the 
types  of  actions  that  rational  agents  will  take  in  each.  Many  of  the  same  questions  that 
come  up  in  our  work  have  been  addressed  by  game  theoreticians.  Their  approach,  however, 
has  left  a  great  many  important  issues  unexamined.  Consider  the  following  quote  from  the 
classic  game  theory  text,  [18]: 

Though  it  is  not  apparent  from  some  writings,  the  term  “rational”  is  far  from 
precise,  and  it  certainly  means  different  things  in  the  different  theories  that  have 
been  developed.  Loosely,  it  seems  to  include  any  assumption  one  makes  about 
the  players  maximizing  something,  and  any  about  complete  knowledge  on  the 
part  of  the  player  in  a  very  complex  situation...  [ Games  and  Decisions,  p.  5] 

As  another  example,  consider  the  following  best  plan  interaction: 


A2 

b2 

Ax 

4 

1/2 

Bt 

3/1 

2/3 

It  was  demonstrated  above  that  the  best  plan  case  can  only  be  solved  under  particular 
definitions  of  rationality.  Rapoport  and  Guyer,  however,  writing  in  [23],  put  forward  the 

following  assumption  regarding  agents’  behavior  (citing  the  similarity  with  [25]): 

(A3).  If  a  game  has  a  single  Pareto  equilibrium,  the  players  will  choose  the  strategy 
which  contains  it. . . 

Our  assumption  (A3)  says  that  A  (Aj  is  the  natural  outcome,  which,  of  course, 
is  dictated  by  common  sense. . .  we  shall  refer  to  this  as  a  prominent  solution.  [A 
Taxonomy  of  2x2  Games] 

In  short,  game  theory  has  been  willing  to  take  for  granted  certain  types  of  behavior 
without  carefully  formalizing  its  definitions  of  rationality,  nor  its  assumptions  of  inter-agent 
knowledge. 
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These  questions  are  particularly  important  in  the  field  of  artificial  intelligence.  We  are 
not  interested  in  characterizing  game  matrices:  we  want  to  characterize  agent  rationality 
and  explore  the  consequences  of  various  assumptions.  The  goal  is  to  be  able  to  implement 
intelligent  agents  whose  strategies  of  behavior  will  be  provably  rational. 

§8.  Conclusion 

In  real  world  domains,  intelligent  agents  will  inevitably  need  to  interact  flexibly.  Pre¬ 
vious  work  has  not  modeled  the  full  range  and  complexity  of  agents’  varied  goals.  The 
benevolent  agent  assumption,  which  assumes  that  agents  have  identical  or  non-conflicting 
goals,  has  permeated  previous  approaches  to  distributed  AI. 

This  paper  has  presented  a  framework  for  interaction  that  explicitly  accounts  for 
communication  and  promises,  and  allows  multiple  goals  among  agents.  The  model  provides 
a  unified  solution  to  a  wide  range  of  problems,  including  the  types  of  interactions  discussed 
in  [8]  and  [9].  Through  the  use  of  communication  and  binding  deals,  agents  are  able  to 
coordinate  their  actions  more  effectively,  and  handle  interactions  that  were  previously 

I 

problematical.  By  extending  the  communication  model  even  further,  a  wider  variety  of 
interactions  can  be  handled. 
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